Confidence measures for keyword spotting using support vector machines
نویسندگان
چکیده
Support Vector machines (SVM) is a new and very promising classification technique developed from the theory of Structural Risk Minimisation [1]. In this paper, we propose an alternative out-of-vocabulary word detection method relying on confidence measures and support vector machines. Confidence measures are computed from phone level information provided by a Hidden Markov Model (HMM) based speech recognizer. We use three kinds of average techniques as arithmetic, geometric and harmonic averages to compute a confidence measure for each word. The acceptance/rejection decision of a word is based on the confidence feature vector which is processed by a SVM classifier. The performance of the proposed SVM classifier is compared with methods based on the averaging of confidence measures.
منابع مشابه
Recognition and Rejection Performance in Wordspotting Systems Using Support Vector Machines
Support Vector Machines (SVM) is one such machine learning technique that learns the decision surface through a process of discrimination and has a good generalization capacity [6]. SVMs have been proven to be successful classifiers on several classical pattern recogntion problems [9, 11]. In this paper, one of the first applications of Support Vector Machines (SVM) technique for the problem of...
متن کاملSemi-Supervised Keyword Spotting in Arabic Speech Using Self-Training Ensembles
Arabic speech recognition suffers from the scarcity of properly labeled data. In this project, we introduce a pipeline that performs semi-supervised segmentation of audio then— after hand-labeling a small dataset—feeds labeled segments to a supervised learning framework to select, through many rounds of hyperparameter optimization, an ensemble of models to infer labels for a larger dataset; usi...
متن کاملKeyword Spotting Using Normalization of Posterior Probability Confidence Measures
Keyword Spotting Using Normalization of Posterior Probability Confidence Measures by Rachna Vijay Vargiya Thesis Advisor: Marius C. Silaghi, Ph.D. Keyword spotting techniques deal with recognition of predefined vocabulary keywords from a voice stream. This research uses HMM based keyword spotting algorithms for this purpose. The three most important componenets of a keyword detection system are...
متن کاملبهبود کارایی سیستم کاوشگر کلمات تلفنی با استفاده از نرمالیزاسیون امتیاز اطمینان مبتنی بر روش برنامهریزی خطی
Conventional word spotting systems determine hypothesized keywords and their confidence score using a speech recognizer. Acceptance or rejection of these keywords is intended based on comparison of their scores with a specific threshold. It has been proved that confidence score prepared by recognizer is highly dependent on sub-word structure of each keyword. So comparing assigned scores to keyw...
متن کاملImproving performance of a keyword spotting system by using a new confidence measure
This work describes a HMM-based keyword spotting system. In this system, keywords are modeled as concatenations of the corresponding phoneme models, consequently, no specific databases are needed to train the system. In addition no filler models are required, therefore small computational requirements are necessary. Two main stages define the whole system. The first stage is based on a previous...
متن کامل